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This article provides a central limit theorem for a consistent estimator of population eigenvalues with 
large multiplicities based on sample covariance matrices. The focus is on limited sample size situations, 
whereby the number of available observations is known and comparable in magnitude to the observation 
dimension. An exact expression as well as an empirical, asymptotically accurate, approximation of the 
■ limiting variance is derived. Simulations are performed that corroborate the theoretical claims. A specific 

' application to wireless sensor networks is developed. 
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I. Introduction 



Problems of statistical inference based on M independent observations of an A^-variate random variable 
y, with E[y] = and E[yy^] = R^r have drawn the attention of researchers from many fields for years: 
Portfolio optimization in finance 12, gene coexistence in biostatistics flU, channel capacity in wireless 
communications 0, power estimation in sensor networks [4], array processing Q, etc. 

In particular, rettieving spectral properties of the population covariance matrix R^v, based on the 
observation of M independent and identically distributed (i.i.d.) samples . . . ,y^ M \ is paramount 
to many questions of general science. If M is large compared to N, then it is known that almost surely 
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||RiV — Rjv|| — > 0, as M — > oo, for any standard matrix norm, where Rjv is the sample covariance matrix 
^ N ~ "H Em=i y( m )y( m )- fl '. However, one cannot always afford a large number of samples, especially 
in wireless communications where the number of available samples has often a size comparable to the 
dimension of each sample. In order to cope with this issue, random matrix theory [6], [7 ] has proposed new 
tools, mainly spurred by the G-estimators of Girko JSJ. Other works include convex optimization methods 
J9), ifTOl and free probability tools ifTTTl . |[T2l . Many of those estimators are consistent in the sense that 
they are asymptotically unbiased as M, N grow large at the same rate. Nonetheless, only recently have 
techniques been unveiled which allow to estimate individual eigenvalues and functionals of eigenvectors 
of R. The main contributor is Mestre |[T3l - |[T4l who studies the case where R^ = UatDatU^ with D^r 
diagonal with entries of large multiplicities and U/v with i.i.d. entries. For this model, he provides an 
estimator for every eigenvalue of R with large multiplicity under some separability condition, see also 
Vallet et al. ITT31 . Couillet et al. H for more elaborate models. 

These estimators, although proven asymptotically unbiased, have nonetheless not been fully character- 
ized in terms of performance statistics. It is in particular fundamental to evaluate the variance of these 
estimators for not-too-large M, N. The purpose of this article is to study the fluctuations of the population 
eigenvalue estimator of |[T4l in the case of structured population covariance matrices. A central limit 
theorem (CLT) is provided to describe the asymptotic fluctuations of the estimators with exact expression 
for the variance as M, N tend to infinity . An empirical approximation, asymptotically accurate is also 
derived. 

The results are applied in a cognitive radio context in which we assume the co-existence of a licensed 
(primary) network and an opportunistic (secondary) network aiming at reusing the bandwidth resources 
left unoccupied by the primary network. The eigenvalue estimator is used here by secondary users to 
estimate the transmit power of primary users, while the fluctuations are used to provide a confidence 
margin on the estimate. 

The remainder of the article is structured as follows: In Section [TTJ the system model is introduced 
and the main results from |fT3l , |[T4l are recalled. In Section [Till the CLT for the estimator in |[T4l is 
stated with the asymptotic variance. In Section JVJ an empirical approximation for the variance is derived. 
A cognitive radio application of these results is provided in Section |Vj with comparative Monte Carlo 
simulations. Finally, Section [VT] concludes this article. Technical proofs are postponed to the appendix. 
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II. Estimation of the population eigenvalues 

A. Notations 

In this paper, the notations s,x, M stand for scalars, vectors and matrices, respectively. As usual, ||x|| 
represents the Euclidean norm of vector x and ||M|| stands for the spectral norm of M. The superscripts 
(•) T and (-) H respectively stand for the transpose and transpose conjugate; the trace of M is denoted by 
Tr(M); the mathematical expectation operator, by E. If x is a N x 1 vector, then diag(x) is the N x N 
matrix with diagonal elements the components of x. If z G C, then 3t(z) and ^s(z) respectively stand 
for z's real and imaginary parts, while i stands for a/^T; z stands for z's conjugate and 5k£ is denoted 
as Kronecker's symbol (whose value is 1 if k = £, otherwise). 

If the support S of a probability measure over R is the finite union of closed compact intervals for 
1 < k < L, we will refer to each compact interval as a cluster of S. 

If Z G £ NxN i s a nonnegative Hermitian matrix with eigenvalues 1 < i < N), we denote in the 
sequel by eig(Z) = 1 < i < N} the set of its eigenvalues and by F z the empirical distribution of 
its eigenvalues (also called spectral distribution of Z), i.e.: 

1 N 

i=l 

where 5 X stands for the Dirac probability measure at x. 

d y 

Convergence in distribution will be denoted by — >, in probability by — >; and almost sure convergence, 

. a.s. 

by >. 

B. Matrix Model 

Consider a N x M matrix Xat = (Xij) whose entries are independent and identically distributed 
(i.i.d.) random variables, with distribution CN(0, 1), i.e. X; L j = U + iV, where U, V are both i.i.d. 
real Gaussian random variables N(0, \). Let be a N x N Hermitian matrix with L (L being fixed) 
distinct eigenvalues p\ < ■ ■ ■ < pl with respective multiplicities Ni, ■ ■ ■ , Nl (notice that Ylf=i Ni = AO- 
Consider now 

The matrix = [yi,--- ,yju] is the concatenation of M independent observations [yi,--- ,Ym], 

1/2 

where each observation writes y^ = R^y Xj with = [xi,--- ,xm). In particular, the (population) 
covariance matrix of each observation yj is R^ = Ey^y^. In this article, we are interested in recovering 
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information on R^v based on the observation 

which is referred to as the sample covariance matrix. 

It is in general a complicated task to infer the spectral properties of Rjv based on R^r for all finite 
N, M. Instead, in the following, we assume that N and M are large, and consider the following asymptotic 
regime: 

Assumption 1 (Al): 

N, M -> oo , with — -> c G (0, oo) , and ->• q 6 (0, oo) , 1 < i < L. (1) 

This assumption will be shortly referred to as N, M — > oo. 
Assumption 2 (A2): 

We assume that the limiting support § of the eigenvalue distribution of Rjv is formed of L compact 
disjoint subsets (cf. Figure [[)). Following Ifl4l . one can also reformulate this condition in a more analytic 
manner: The limiting support of R^ is formed of L clusters if and only if for i £ {1, .., L}, mf^T^ — 
^at(z)} > 0, where 



i?Zr=lNr\^) m=l, 

max { i Yi= i N r {-^-) 2 , & Er=l Nr 2 } 1 < m < L, 

where «!<•••< are L — 1 different ordered solutions to the equation 



L 



,2 



-Viv r 7 p; - =o. 

This condition is also called the separability condition. 

Figure Q] depicts the eigenvalues of a realization of the random matrix R^r and the associated limiting 
distribution as N, M grow large, for p\ = I, p% = 3, p3 = 10 and N = 60 with equal multiplicity. 

C. Mestre 's Estimator of the population eigenvalues 

In lfl4l . an estimator of the population covariance matrix eigenvalues (p^; 1 < k < L) based on the 
observations Rjv is proposed. 
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Fig. 1. Empirical and asymptotic eigenvalue distribution of Rat for L = 3, pi = 1, pa = 3, ps = 10, N/M = c = 0.1, 

N = 60, Ni = N 2 = N 3 = 20. 



Theorem 1: IU41 Denote by Ai < • • • < Ayv the ordered eigenvalues of R^r- Let M, N — > oo in the 
sense of the assumption (Al). Under the assumptions (A1)-(A2), the following convergence holds true: 



Pk ~ Pk — ► , (2) 

M,N— >oo 



where 

M 



Pl~ ~ ~ ^2 \^rn-Pm) , (3) 



fe-1 



with N fc = {ZjZi Nj + 1,..., £ -=i ^} and Ai < • • • < Aw the (real and) ordered solutions of: 



V A m M 



(4) 



D. Integral representation of estimator pf. - Stieltjes transforms 

The proof of Theorem Q] relies on random matrix theory, and in particular, iPToll . iTTTl use as a key 
ingredient the Stieltjes transform. 

The Stieltjes transform nif of a probability distribution P over M + is a C- valued function defined by: 

m ¥ (z) = / ^1 , z G c\M+ . 
Jr+ A — z 

There also exists an inverse formula to recover the probability distribution associated to a Stieljes 
transform: Let a < b be two continuity points of the cumulative distribution function associated to 
P, then 
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P([o,6l) = -limQf 

7T ylO 



mp(x + iy)dx 



In the case where F z is the spectral distribution associated to a nonnegative Hermitian matrix Z E 
^ xAr with eigenvalues 1 < i < N), the Stieltjes transform mz of F z takes the particular form: 

F z (d\) 



m z {z) 



X-z 
N 



w ^ 6 - z jv 

1=1 

which can be seen as the normalized trace of the resolvent (Z — zIn)~ . Since the seminal paper of 
Marcenko and Pastur [16], the Stieltjes transform has proved to be extremely efficient to describe the 
limiting spectrum of large dimensional random matrices. 

In the following, we recall some elements of the proof of Theorem [T] necessary for the remainder of 
the article. The first important result is due to Bai and Silverstein lfl7l (see also |[T6l ). 

Theorem 2: H71 Denote by F R the limiting spectral distribution of K N , i.e. F R (<i A) = Y^k=i ^^pk (d A). 
Under the assumption (Al), the spectral distribution F Rjv of the sample co variance matrix R^r converges 
(weakly and almost surely) to a probability distribution F as M, N — > oo, whose Stieltjes transform m(z) 
satisfies: 

. . 1 . . / 1\ 1 

mlz) = -m(z) — 1 — , 

c \ cj z 

for z € C + = {z € C, ^s(z) > 0}, where m(z) is defined as the unique solution in C + of: 

t 



m(z) = -[z-c I "— rdF R (t) 

Note that m(z) is also a Stieltjes transform whose associated distribution function will be denoted F_, 
which turns out to be the limiting spectral distribution of F— « where Rjy is defined as: 

Denote by fn-^ N {z) and (z) the Stieltjes transforms of F Rn and F— «. Notice in particular that 

f \ M ( \ (^ M \ l 



Remark 1: This relation associated to (01) readily implies that (/ij) = 0. Otherwise stated, the 
Uj's are the zeros of . This fact will be of importance in the sequel. 
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Denote by miy(z) and m N (z) the finite-dimensional counterparts of m(z) and m(z), respectively, 
defined by the relations: 

m N (z) = -(z-—l- t ——-dF RN (t) 

M . . ( M\\ 
m N {z) = —m N (z) - (^1 - — j - . 

It can be shown that tun and m N are Stieltjes transforms of given probability measures Fn and F_ N , 
respectively (cf. Theorem 3.2]). 

With these notations at hand, we can now derive Theorem Q] By Cauchy's formula, write: 



N k 2mJ Tk \N^ r . 



/•'/,• ~ f> ( — > , N r W _ w dw 



where T k is a negatively oriented contour taking values on C \ {pi, ■ ■ ■ ,pl} and only enclosing p k . 
With the change of variable w = an d the condition that the limiting support § of the eigenvalue 

distribution of Rjy is formed of L distinct clusters (Sfe, 1 < k < L) (cf. Figure [B, we can write: 

M f m' N (z) , 
Pk = f z=^-dz , 1 < k < L (5) 

2iirN k J ek m N {z) 

where and denote negatively oriented contours which enclose the corresponding clusters and 
respectively. Defining 

M f m 'fi ( z ) 
2iriN k J Q m A (z) 

dominated convergence arguments ensure that p k — p k — > 0, almost surely. The integral form of p k can 
then be explicitly computed thanks to residue calculus, and this finally yields ©. 

The main objective of this article is to study the performance of the estimators (p k , 1 < k < L). More 
precisely, we will establish a central limit theorem (CLT) for (M(p k — p k ), 1 < k < L) as M, N — > oo, 
explicitly characterize the limiting covariance matrix = (0fc^)i<fc^<L, and finally provide an estimator 
for 0. 

III. Fluctuations of the population eigenvalue estimators 

A. The Central Limit Theorem 

The main result of this article is the following CLT which expresses the fluctuations of (p k , 1 < k < L). 
Theorem 3: Under the assumptions (A1)-(A2) and with the same notations: 

(M( Pk -p k ), 1 < k < L) x „ H x (0, 0) , 

M,N— >oo 

where Nx, refers to a real L-dimensional Gaussian distribution, and is a L x L matrix whose entries 
Q k e are given by CO), where Q k and Qi are defined as before (cf. Formula ©). 
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1 



47T 2 CfcQ 



m'(z-i )m / (za) 1 
(m(zi) - m(z 2 )) 2 (21 - Z2) 1 



1 



m(zi)m(z2) 



dz\ dz 



\az2 . 



(7) 



B. Proof of Theorem \3\ 

We first outline the main steps of the proof and then provide the details. 
Using the integral representation of pk and pk, we get: Almost surely, 

M 2 I ( m A N ^ m' N (z) 



M(p k - p k ) 



2-KiNk J Gk ~ \ m A N ( z ) IRn( z ) 
Denote by C(6&,C) the set of continuous functions from to C endowed with the supremum norm 
IMloo = su Pe fc \ u \- Consider the process: 

(Xn,X' n ,un,u' n ) : Cfc — > C 4 

where 

X N (z) = M (m^(z) - m N {z)\ , 
X' N (z) = M (m'^(z) - m' N (z)^ , 
ua^z) = rriA (z) , u' N (z) = mL (z) . 

— —N ±~.N 

Then due to 'no eigenvalue' result (cf. ifTBl , see also Proposition [j}, (Xn,X' n ,un,u' n ) almost surely 
belongs to C(Cfc,C) and M(f>k — Pk) writes: 

/ m N (z)X' N (z) - u' N (z)X N (z) \ ^ 



dz 



M 



fc Je 



m N (z)u N (z) 



A 



Tn(Xn, X n , Un, u N ) , 



where 



Tjv(a?, x' , u, v!) 



M f fm N (z)x'{z) ~ u'(z)x(zy. 1 
2-KiNk J efc V m N (z)u(z) 



(8) 



If needed, we shall explicitly indicate the dependence in the contour and write Tjv(x, x', u, u', Sf.). 
The main idea of the proof of the theorem lies in three steps: 

(i) To prove the convergence in distribution of the process (Xjv, X' N , u^, u' N ) to a Gaussian process. 

(ii) To transfer this convergence to the quantity Tn(Xn, X' n , ujv, u' n ) with the help of the continuous 
mapping theorem |fl9l . 
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(iii) To check that the limit (in distribution) of Tn(Xn, X' n , un, u' n ) is Gaussian and to compute the 
limiting co variance between Tn(Xn, X' n , un, u' n , C^) and Tn(Xn, X' n , un, u' n , Gi). 
Remark 2: Note that the convergence in step (i) is a distribution convergence at a process level, hence 
one has to first establish the finite dimensional convergence of the process and then to prove that the 
process is tight over Cfc. Tightness turns out to be difficult to establish due to the lack of control over 
the eigenvalues of whenever the contour crosses the real line. In order to circumvent this issue, we 
shall introduce, following Bai and Silverstein [20], a process that approximates Xn and X' N . 
Let us now start the proof of Theorem [3] 
Lemma 1: Under the assumptions (A1)-(A2), the process 

(Xn,X' n ) : Q k — > C 4 

converges in distribution to a Gaussian process (X, Y) with mean function zero and covariance function: 

cov(X(z),X{z)) 

cov(Y(z),X(z)) 
cav(X(z),Y(z)) 
cav(Y(z),Y(z)) 

Lemma Q] is the cornerstone to the proof of Theorem [3] The proof of Lemma Q] is postponed to Appendix 
|B1 and relies on the following proposition, of independent interest: 

Proposition 1: Under the assumptions (A1)-(A2) and denote by S the support of the probability 
distribution associated to the Stieltjes transform m. Then, for every e > 0, i € N*: 

P ( sup d(A,S) > e ) = 

\Aeoig(R N ) / 

where d(A, S) = inf^gg |A — x\. 

The proof of Proposition Q] is postponed to Appendix |A] 

As (un,u' n ) — — — > (m,m'), a straightforward corollary of Lemma Q] yields the convergence in 

N,M->oo 

distribution of (Xff, X' N , u^, u' n ) to (X,Y,m,mf). This concludes the proof of step (i). 

A direct consequence of Lemma Q] yields that (Xn, X' n , un, u' n ) : Cfc — > C 4 converges in distribution 
to the Gaussian process {X,Y,m,m') defined as before. We are now in position to transfer the con- 
vergence of (Xn,X' n ,un,u' n ) to Tn(Xn, X' n , un, u' n ) via the continuous mapping theorem, whose 
statement as expressed in lTT9l is reminded below. 



m [zjm \z) 



1 



(m{z) — m{z)) 2 (z — z) 2 
dzdz 



-k(z, z) . 



k(z, z) 



(9) 
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Proposition 2 (cf. / [79l 77?. 4.27]): For any metric spaces Si and 52, let £, (£ n )n>i be random elements 

D 

in Si with £ n > £ and consider some measurable mappings /, (f n )n>i- Si h-> §2 and a measurable 

71— >00 

set rc5i with £ G T a.s. such that /„(s„) -> /(s) as s„ -> s G T. Then / n (£ n ) 13 > /(£). 

It remains to apply Theorem [2] to the process (Xn, X' n , un, u' n ) and to the function Tjy as defined 
in ©. Denote b>L! 

1 /" f m(z)y(z) -w(z)x(z)\ 
T(x,y,v,w) = — — f z[ , 

and consider the set 

(x,y,v,w) G C 4 (e fc ,C) , inf |u| > 



Then, it is shown in ||6l Section 9.12.1] that infg fe |m| > 0, and, by a dominated convergence theorem 
argument, that (xjy,yN,viy,WN) — > (x,y,v,w) G T implies that T]y(xiy,yN,VN,'WN) — > T(x,y,v,w). 
Therefore, Theorem [2] applies to Tn(xn,Vn,vn,wn) and the following convergence holds true: 

Yn(Xn,X' n ,u n ,u' n ) — > T(X,Y,m,m') , 

M,N—toc 

and step (ii) is established. 

It now remains to prove step (iii), i.e. to check the Gaussianity of the random variable T(X, Y,m,mf) 
and to compute the covariance between T(X, Y, m, m', Ck) and T(X,Y,m,m' , Qg). 

In order to propagate the Gaussianity of the deviations in the integrands of ([6]) to the deviations of 

the integral which defines pk, it suffices to notice that the integral can be written as the limit of a finite 

Riemann sum and that a finite Riemann sum of Gaussian random variables is still Gaussian. Therefore 

M(pk — Pk) converges to a Gaussian distribution. As inf zg e fc |n*(*OI > 0> a straightforward application 

of Fubini's theorem together with the fact that E(X) = E(Y) = yields: 

/ / m'(z)X(z) Y(z)\ 
E f z ~ h V " z ~ri )dz = 0. 

It remains to compute the covariance between T(X, Y, m, rn', Gk) and T(X, Y, m, m', C/) for possibly 
different contours and C^. We shall therefore evaluate, for 1 < k,£ < L: 

@U = K(T(X,Y,m,m',e k )T(X,Y,m,m f ,e e )) , 

(a) 1 f f frn(zi)m(z 2 )K(zi,Z2) m (zi)d 2 n(zi, z 2 ) 

" ZiZ2\- 



^ 2 c k ciJe k Je t V m, 2 (z 1 )m 2 (z 2 ) m 2 (z 1 )m(z2) 

m(z 2 }diK(z 1 ,z 2 ) , dl 2 K(zt,z 2 )\ 

H 7 — \ — } — r )aziaz2 , 



m(zi)m 2 (z2) m(zi)m(z9.) < 
'As previously, we shall explicitly indicate the dependence on the contour 6^ if needed and write T(x, x' , u, u' , Ck 
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47T 2 N k N £ 



m A (z 1 )m A (z 2 ) 



dz\dz 2 



(10) 



where (a) follows from the fact that inf zge(c |m(V)| > together with Fubini's theorem, and 81,82, df 2 
respectively stand for 8/dz\, d/dz2 and d 2 /dz\dz2- 
By integration by parts, we obtain 



ziz 2 rn'(z 2 )diK(zi, z 2 ) 
m(z-\)m 2 (z9.) 

f ( z 2 m'(z 2 )K(zi,z 2 ) z 1 z 2 ri (z^m' (z 2 )k(zi, z 2 y . ^_ 

/ \ m{zi)ui 2 (z 2 ) m 2 (zi)m 2 (z 2 ) 

z 1 z 2 m(z 2 )d lt2 K(zi,z 2 ) 



Similarly, 



m(zi)m 2 (z 2 ) 



-dz\ 



z 2 d 2 K(z 1 , z 2 ) ^ + / Ziz 2 ml{zi)d 2 K{zi, z 2 ) ^ 



m(zi)m(z 2 ) J m 2 (z-\ )m(z?) 

Hence 

1 f 1 1 Z 2 m'(z2)K(z 1 ,Z 2 ) , f f Z 2 &2K(zi,Z 2 ) 

®ki = ~~r~2 \ f f ? \ 21 \ dz!dz 2 - <p <t> — ———dz 1 dz2 

47T z c k ci [Je k Je e m{zi)mr(z 2 ) J £k J et rn{zi)rn(z 2 ) 

Another integration by parts yields 

Z 2 dM Zl ,Z 2 ) dz2 = _l _J<ZuZ2)_ dz2 + f Z2^(z2Mz 1 ,Z 2 ) dz2 _ 



m(zi)m(z2) J rn(zi)m(z2) J m(zi)m 2 (z2) 

Finally, we obtain: 



1 1 1 K ( Z 1,Z2) , , 

Bjw = -T-5 <f f —, — \ — r ^dz 1 dz 2 

^ 2 c k ci J e J e m{zi)m{z 2 ) 



and Q is established. 



IV. Estimation of the covariance matrix 

Theorem [3] describes the limiting performance of the estimator of Theorem [TJ with an exact charac- 
terization of its variance. Unfortunately, the variance depends upon unknown quantities. We provide 
hereafter consistent estimates for based on the observations Rjy. 
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e 



k(' 



M 2 

N k N e 



E 



. (Ai - Aj) 2 ™4 (Ai) 



ml (A,) 2 ' 

— . iiiv 



(11) 



Theorem 4: Assume that the assumptions (A1)-(A2) hold true, and recall the definition of 0^ given 
in (O. Let Q^i be defined by (flTT) . where (3%) and (/ifc) are defined in Theorem Q] then: 

as N, M -)■ oo. 

Theorem 0] is useful in practice as one can obtain simultaneously an estimate p k of the values of 
as well as an estimation of the degree of confidence for each p k . 

Proof: In view of formula (O, and taking into account the fact that and m'~ are consistent 

estimates for m and m', it is natural to define by replacing the unknown quantities m and m' in $7} 
by their empirical counterparts m R and m'~ , hence the definition of 0^ in ( fTOt - 

The proof of Theorem @] now breaks down into two steps: The convergence of 0^ to Q^, which relies 
on the definition (fTOt of Q^e and on a dominated convergence argument, and the effective computation 
of the integral in ( fTOl ) which relies on Cauchy's residue theorem ETTl . and yields (ITTb . 

We first address the convergence of Q^t to 0^. Due to [18], |[22l . almost surely, the eigenvalues of 
R N will eventually belong to any e-blow-up of the support S of the probability measure associated to m, 
i.e. the set {i£R: d(x,§) < e}. Hence, if e is small enough, the distance between these eigenvalues 
and any z G will be eventually uniformly lower-bounded. By lfl4l Lemma 1], the same result holds 
true for the zeros of m R (which are real). In particular, this implies that m R is eventually uniformly 
lower-bounded on (if not, then by compacity, there would exist zGCj. such that m R (z) = which 
yields a contradiction because all the zeroes of m R are strictly within the contour). With these arguments 
at hand, one can easily apply the dominated convergence theorem and conclude that a.s. Q^e ©fc£- 

We now evaluate the integral (fTOl) by computing the residues of the integrand within and C^. There 
are two cases to discuss depending on whether k ^ I and k = £. Denote by h{z\^z-i) the integrand in 
(fTOl ), that is: 
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/ m' A («i)m' ft (z 2 ) 1 \ 1 
h(z 1 ,z 2 )=[- =f— ~" ; \ \ 2 ~? x r~\ 1 — V (12) 



We first consider the case where k 7^ I. 

In this case, the two integration contours are different and it can be assumed that they never intersect 
(so it can always be assumed that z\ / z 2 ). Let z 2 be fixed, and denote by fa the zeroes (labeled in 
increasing order) of , then the computation of the residue Res(h(-, z 2 ), fa) of h(-,z 2 ) at a zero fa 
of m-n which is located within is straightforward and yields: 

—N 

A /^R (Ai) m/ R ( Z 2 ) 1 \ 1 

r(z 2 ) = Res(/i(-,^),/ii) = ~ w 2 , =f ^ ; 7^ . (13) 

Similarly, if one computes Res(r, jlj) at a zero fij of located within one obtains: 

Then we need to consider the residue £ on the set 3? 22 = {zi : m-^ (zi) = (^2) / 0, zi / 22}- 

(If this set is empty, then the residue is zero.) Notice that £ is not a residue of -, — -, — \ -, — T , 

hence one needs to compute 

"4 Oi) m ' R (^) 

g{zi,z2) 



(m^Jz!) - m^ N ( Zl )f m^ N ( Zl )m^ N {z 2 ) 
for the residue £. By integration by parts, one gets 

I ( u / m/ R w (zi)m ^ (z2) <**i 

* 5(2:1, z 2 )dzi = - * =— = — ——- 2 — . 

/ / ( m R„( Z l) - m R N ( Z 2j)^ N Ul)^^) 

Let fc = min{i G N* : rr$ (£) 7^ 0}, then by a Taylor expansion 

«%,(*•) = ™a>> + ^nr^K) + 0(21 " & - 

km'~ (z 2 ) 

Res( 5 ,£) = 

mi (z 2 ) 

As it is the derivative function of 2mi k , y the integration with respect to z 2 is zero. 
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It remains to count the number of zeros within each contour. By [14, Lemma 1], eventually, there are 
exactly as many zeros as eigenvalues within each contour. It has been proved that the contribution of the 
residues of £ on 3? 22 is null, hence the result in the case k / £: 

^ 2 y l 

We now compute the integral ( fTOl ) in the case where k = I, and begin by the computation of the 
residues at fa. The definition (fl3l) of r and the computation of Res(r, p,j) still hold true in the case 
where fij is within but different from fa. It remains to compute Res(r, fa). Taking z 2 — > we get: 

1 \ 1 



lim (z 2 - fa 



zl™S Z2 ^' \m'~ (fa)m A (z 2 )(fa-z 2 ) 2 l m'\ (fa)' 

i , i m£ (AO 



Finally, 



m' (Ai)"iR N ( z 2)(Ai-^2) 2 2 (fa)(z 2 -faf) 2mL 3 (fa 

m 'Lifai) 



lim (z 2 - fa) [ — - — 7 — ^ , 2 - w TTq + 

Zl^t^i 



m'- {fa)m A (z 2 )(fa-Z2) 2 m'i (Ai)( z 2 - Ai) 3 2m /3 ^ (fa)(z 2 - faf 

6m' A (fa) 3 4m' A (fa) 4 ' 
Hence the residue: 

m'£ (fa) m'4 (fa) 2 

Kes(r, fa 



i) 



6m' R (Ai) 3 AmL (fa 

There are two other residues that should be taken into account for the computation of the integral: The 

residues of £ on Jl Z2 , and the residue for z\ = z 2 . The first case can be handled as before. For z\ = z 2 , 

the calculus of g(z\,z 2 ) for the residue z\ = z 2 is exactly the same as before. It remains to compute 
i l 

{z 1 -z 2 ) 2 m A ( Zl )m A (z 2 ) 



1 1 for the residue z\ = z 2 . The integration by parts yields that: 



1 



dzt f m L (Zl) dzi 



(z 1 -z 2 ) 2 m^ (zi)m-z (z 2 ) J (z\-z 2 )m\ (zi)m^ (z 2 )' 



Then the residue for z\ = z 2 is: 

mL (z 2 ) 
ml (z 2 )' 

Again, this is the derivative function of 2m ^ 1 ( Z2 ) > ^ en the integration is zero. 
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Finally both have a null contribution, hence the formula: 



e 



kk 



(k - Ai) 2 "4 (Ai) 

XV at XX at 



+ E 



m 



R 



(£») 



m 



R 



(/ij) 3 (&) 



V. Performance in the context of cognitive radios 

We introduce below a practical application of the above result to the telecommunication field of 
cognitive radios. Consider a communication network implementing orthogonal code division multiple 
access (CDMA) in the uplink, which we refer to as the primary network. The primary network is 
composed of K transmitters. The data of transmitter k are modulated by the orthogonal iV-chip 
codes Wjt i, . . . , Wfc „ fc G C^. Consider also a secondary network, in sensor mode, that we assume 
time-synchronized with the primary network, and whose objective is to determine the distances of the 
primary transmitters in order to optimally reuse the frequencies used by the primary transmitter^. From 
the viewpoint of the secondary network, primary user k has power Then, at symbol time m, any 
secondary user receives the iV-dimensional data vector 

k=l j=l 

with G an additive white Gaussian noise CN(0, <7 2 I) received at time m and x™) G C 

the signal transmitted by user k on the carrier code j at time m, which we assume CN(0, 1) as well. 
The propagation channel is considered frequency flat on the CDMA transmission bandwidth. We do not 
assume that the sensor knows a 2 neither the vectors w^, The secondary users may or may not be aware 
of the number of codewords employed by each user. 
Equation (fT4l can be compacted under the form 

y (m) = W p| x M + an (ra) 

with W = [wi,!, . . . , wi )Kl , w 2j i, . . . ,VfK,n K ] € C Nxn , n = Y,k=i n ^ p e < & nxn the diagonal matrix 
with entry Pi of multiplicity m, P2 of multiplicity ri2, etc. and Pr of multiplicity nx, and x^" 1 ) = 
[x^ T , . . . , x^ T ] T S C n where xjj. G C nk is a column vector with j-th entry x^ 1 - . 

2 the rationale being that far transmitters will not be interfered with by low power communications within the secondary 
network. 
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Gathering M successive independent observations, we obtain the matrix Y = [y^\ . . . ,y^ M ^] € 
,JVxM giyen by 

X 

WPt al N 

N 



Y = WP^X + ctN 



where X = [ X W , . . . , x( M )] and N = [nW , . . . , n( M )]. 

The y( m ) are therefore independent Gaussian vectors of zero mean and covariance R = WPW fi + 
(J 2 In- Since the objective is to retrieve the powers P k , while a 2 is known, the problem boils down to 
finding the eigenvalues of WPW fl + cj 2 In- However, the sensors only have access to Y, or equivalently 
to the sample covariance matrix 



RiV a _}_yy h = — Y yWyHff. 



m=l 

Assuming Rjy conveys a good appreciation of the eigenvalue clustering to the secondary user (as in 
Figure [lj, Theorem Q] enables the detection of primary transmitters and the estimation of their transmit 
powers Pi, ... , Pk', this boils down to estimating the largest K eigenvalues of WPW fl + cx 2 lAr, i.e. 
the Pk + a 2 , and to subtract a 2 (optionally estimated from the smallest eigenvalue of WPW^ + cr 2 Ijv 
if n < N). Call P k the estimate of P k . 

Based on these power estimates, the secondary user can determine the optimal coverage for secondary 
communications that ensures no interference with the primary network. A basic idea for instance is to 
ensure that the closest primary user, i.e. that with strongest received power, is not interfered with. Our 
interest is then cast on Pk- Now, since the power estimator is imperfect, it is hazardous for the secondary 
network to state that K has power Pk or to add some empirical security margin to Pk- The results of 
Section [III] partially answer this problem. 

Theorems [3] and [4] enable the secondary sensor to evaluate the accuracy of P k . In particular, assume 
that the cognitive radio protocol allows the secondary network to interfere the primary network with 
probability q and denote A the value 

A ± M{F(P K - P K > a) < q}. 

a 

According to Theorem[3j for N, M large, A is well approximated by @k i kQ~ 1 {q), with Q the Gaussian 
cumulative distribution function. If the secondary users detect a user with power Pk, estimated by Pk, 
P(Pk + A < Pk) < q and then it is safe for the secondary network to assume the worst case scenario 
where user K transmits at power Pk + A ~ Pk + QkQ 1 {(1)- 

In Figure |21 the performance of Theorem [3] is compared against 10,000 Monte Carlo simulations of 
a scenario of three users, with Pi = 1, P 2 = 3, P3 = 10, n\ = ri2 = = 20, N = 60 and M = 600. 
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1 1 Histogram of the 

Theoretical limiting distribution 



3 



2 



(1 



1 




3 



10 



Estimates 



Fig. 2. Comparison of empirical against theoretical variances, based on Theorem [3] for three users, Pi = 1, P 2 = 3, P3 = 10, 
ni = 712 = n 3 = 20 codes per user, N = 60, M = 600 and SNR= 20 dB. 

It appears that the limiting distribution is very accurate for these values of N, M. We also performed 
simulations to obtain empirical estimates of 0^ from TheoremHl which suggest that is an accurate 
estimator as well. 



In this article, we derived an exact expression and an approximation of the limiting performance of 
a statistical inference method that estimates the population eigenvalues of a class of sample covariance 
matrices. These results are applied in the context of cognitive radios to optimize secondary network 
coverage based on measures of the primary network activity. 



A. Proof of Proposition [7] 

Let us first begin by considerations related to the supports of the probability distributions associated 
to m(z) and mjv(z). Denote by S and §n these supports and recall that S is the union of L clusters: 



VI. Conclusion 



Appendix 



S = (ai,6i)U---U(oL,6 L ) . 



The following proposition clarifies the relations between S^r and S. 
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Proposition 3: Let N, M — >• oo, then for N large enough, the support §>n of the probability distribution 
associated to the Stieltjes transform mniz) is the union of L clusters: 

S A r = (af,6f)U---U(af,6^). 

Moreover, the following convergence holds true : 

N i N i 

a £ > at , b e > b e , 

N,M^oo N,M^oo 

for 1 < I < L. 

Remark 3: If the support Sn contains zero, (ex: N > M), then zero is also in the support §, the 
conclusion is still true. 

Proof of Proposition^ Recall the relations: 

^» = -( 2 -s/iriw rfFR " (t) )" <15) 

and 

. . M ( M\ 1 

m N {z) = —m N (z) - 11 - — j -. (16) 

As the inverse of Stieltjes transform of — - is 8q (the Dirac mass on 0) and m^{z) is a continuous 
function over W + , for a,b with < a < b, by the inverse formula of Stieltjes transform, one gets: 

F N ([a,b]) = ^F N ([a,b}). 

So it suffices to study the support associated to F_ N . 

From the definition of m N (z) ( see formula (031)). we obtain: 



, 1 N f tdF R ™(t) 

Z * AmN) = -^ + Mj l + tm N (zy 



Denote by B = {m G R : m / 0, -mT x £ {pi,-- - ,Pl}}- In HU Theorem 4.1 and Theorem 4.2], 

& e B and 4t N (™J 



Silverstein and Choi show that for a real number x, x G S^r <^=^ m G B and z4> (?rO ~~ ! 



5 / tS^Ef > with — = — 1 and zr ~(^) = x - 

Then if a G <9§tv, m a ^ B or z^ n (?7i a ) < with m a = m N (a). Now we will show that m a G i?. In 
Theorem 5.1], m a / 0. If — m™ 1 G SV r n, as F Rn is discrete, we get that lim m _j. ma J 1 ^_ t ^P — > oo. 
So on the neighborhood to the left and to the right of m a , < which contradicts ll23l Theorem 5.1]. 

Hence (m a ) < 0. By the continuity, we get 
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1 N f t 2 dF R «(t) _ n 
^ N " m 2 M J (l+tm a y~ U - 

It is equivalent to the following equation: 

11^ ? 

4 N K) = — " T7 E N i (,, P ' ^2 = °- (17) 
m£ M f-j' (1 + PiTXlaY 

By multiplying the common denominator, one will get a polynomial of the degree 2L in m a . Now we 
will show that these 2L roots are real. At first, notice that 

1 N ft 2 dF RN (t) 

1 > -oo, 



m 2 M J (l + tm) 2 m^-j 
and 



„ , . 2 N [2t 6 

z" (m) = H / — 

RnV ^ m 3 M J (1 



2t 3 dF RN (t) 



+ im) 



3 



So 4 N (?n) has one and only one zero in the open set (— i,— — ^-) for i G {1, • ■ • , L — 1}. Then 
for Pi G (— J - ) - ^77) sucn th at Z R N (/^*) = 0> it suffices to show that £r, n (A) > in order to prove 
that there will be two zeros for z^ N (m) in the set (— j-j—j^)- From the separability condition (cf. 
Assumption (A2)), mf N {j^ - ^(i)} > 0, and 

1 2 N f t 2 dF R "(t) 
Z ^ [ ~W~ ai ~M J (l- X)2 



2 



Thus we obtain 2(L — 1) roots. Besides, in the open set (— p~£ ,0), 

1 iV /" t 2 dF RN (t) 

m 2 ~ M J (1 + tm) 2 m a ^o- +OC 

there exists another root in this set. In the open set (— 00, — p^ 1 ), 

1 N ft 2 dF RN (t) 

1 > 



in 



M I (1 + tm) 2 m-¥-oo 



N [t 2 dF R »(t) 1 . (1 _A) >0 . 



and 

1 

m 2 M J (1 + tm) 2 m 2 v " M 

Hence the last root in this open set. This proves that §n = (a>\ , &i ) U • • • U (a^ , b^). 

To prove > ap and > bp , notice that fej satisfy the same type of the equation 

by replacing || by c and F Rn by F R . As |j — >• c and — >■ Cj, the roots of Equation (flTl) converge to 
those of the limit equation (see O). Thus we achieve the second conclusion. 
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We are now in position to establish the proof of Proposition Q] 

Denote by §(e) the e-blow-up of S, i.e. §(e) = {x G M., d(x,S) < e}. Let e > be small enough 
and consider a smooth function <fi equal to zero on S(e/3), equal to 1 if x ^ S(e), equal to zero again 
if | a; | > r (as we shall see, r will be chosen to be large), and smooth in-between with < (j) < 1: 



0(s) 



if d(x,§) < e/3 , 

1 if <i(x, S) > e , |x| < r — e 
if \x\ > t . 



Notice that if N, M — > oo and N is large enough, then by Proposition [3j (f>(x) = for all x £ Sjy. 
Now if Z is a M x M hermitian matrix with spectral decomposition Z = Udiag (7^ 1 < i < M)) \J H , 
where U is unitary and diag (7$; 1 < i < M)) stands for the M x M diagonal matrix whose entries are 
Z's eigenvalues, write 0(Z) = Udiag (0(7*); 1 < i < M)) XJ H . 
We have: 



RtvI 


> r 


-e) 


+ 


Rat 


> r 




+ 


RjvI 


> r 


-e) 


+ 



(a) 

< PfllRvll >r-el + EPTr (f>fRjv)F 



for every p > 1, where (a) follows from Markov's inequality. The fact that P(||Rjy|| > r) = 0(A^ 
for r large enough and every I G N* is well-known (see for instance [6, Section 9.7]). We shall therefore 
establish estimates over E[Tr </>(Rtv)] p - Take p = 2 k ; we prove the following statement by induction: For 
k > 1 and for every integer /3 < 2 k and for every smooth function / with compact support whose value 
on S(e/3) is zero , 



First notice that due to Proposition [3j f(X) F/v(dA) = (where F/v is the probability distribution 
associated to uin) for N, M large enough (N, M — > 00). A minor modification of [25, Lemma 2] (whose 
model is slightly different) with the help of |26l Proposition 5] yields that for N, M — > 00 and N large 
enough, ETY/(Rtv) = 0(A^ _1 ), and the property is verified for k = 0. 



DRAFT 



August 29, 2011 



21 



Let k > be fixed and assume that the result holds true for f3 < 2 k . We want to show that 
E[Tr/(R Ar )] 2(t+1) = 0(N- 213 ). At step k + l, the expectation writes: 



E[Tr/(Rjv)] 



2( fe + 1 ) 



= E ([Trf^)] 2 ' + E[tr/(R^)] 2 ' - E[Tr/(R^)] 2 
< 2 (Var[Tr f(R N )f + \E[Tr f (K N )f | 2 



(18) 



(19) 



The second term of the right hand side (r.h.s.) of the equation can be handled by the induction hypothesis: 



E[Tr/(R 7V )] s 







1 



N 2 ? J ' 

We now rely on Poincare-Nash inequality (see for instance l(26l Section II-B]) to handle the first term of 
the r.h.s. Applying this inequality, we obtain: 



Var((Tr/(R iV )) 2fc ) < 



E 



d[Trf(R N )f 



dYi 



+ 



d[Tif(R N )] 



2 h 



dY 



(20) 



where K is a constant which does not depend on N, M and which is greater than R^r's eigenvalues. In 
order to compute the derivatives of the r.h.s., we rely on ll27l Lemma 4.6]. This yields: 



d 



d 



dY. 



[Tr f(R N )f 
[TrfCR N )f 



-[TV f(K N )f '^[Yfr/'OMi,* , 
^[Tr/(R^)] 2t - 1 [/'(R JV )Y w ] M . 



Plugging these derivatives into (1201 . we obtain 



Var(Tr[/(R iV )] 2 ") 
< — — - — E 

E 



< 



M 2 

^ 2 2fc+l 

M 
K2 2k + 1 
M 



(Tr/(R iV ))( 2fc+1 - 2 )Tr(/ / (R i v)Y w Y^/'(R J v)) 
(Trf(R N ))( 2k+1 -VTr(f'(R, N ) 2 R N 

2 k+l_ 2 

E[Trf(R N )f +1 x E[Tr/'(Rjv) 2 lW 



where the last inequality is a consequence of Holder's inequality. 

As the function h(X) = A[/'(A)] 2 satisfies the induction hypothesis, we have for every a < 1: 



ETr[/'(R^) 2 R^] 2t 



Q(AT 



Plugging this estimate into (1181) . we obtain 



E[Tr/(R^)] 



2 (fc+i) 



< K 



' \E[Trf(R N )f h+1) \^)+0(N-W), 



(21) 
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where K is a constant independent of M, N, k. Notice that inequality (f2TT > involves twice the quantity 
of interest E[Tr f (lijsr)] 2 h+1 that we want to upper bound by 0{N~ 2 ^). We shall proceed iteratively. 
Notice that Tr [/(Rjv)] < sup xgM \ f{x)\ x N because / is bounded on R; hence the rough estimate: 

E[Tr/(R,v)] 2<fc+1) =0(iV 2fc+1 ). 

Plugging this into (1211) yields: 

E[Tr/(R^)] 2<fe+1) = 0(iV ai ) , 

where ao = 2 k+l and a\ = ao 2 ^+T 2 — (1 + a )- Iterating the procedure, we obtain: 

E[Tr f(k N )f +1) = Q ( N "M-m} f 

where ag = ae-i ^-ft^ — (1 + a) and x\/ y stands for sup(x, y). Now, in order to conclude the proof, it 
remains to prove that i) the sequence (at) converges to some limit Oqo, ii) for some well-chosen a < 1, 

aoo G (-2 fc+1 ,-2/3). Write: 

2 fc — 1 

a m + 2 fc (l + a) = ^^(a £ + 2 fc (l + a)) , 

hence converges to — 2 k (l + a) which readily belongs to (— 2 k+1 , —2/3) for a well-chosen a G (0, 1). 
Finally E[Tr /(Rat)] 2 '^ 1 ' = 0(7V~ 2/3 ) which ends the induction. 

It remains to apply this estimate to E[Tr 0(Rat)] £ in order to get the desired result. 

B. Proof of Lemma [7] 

As explained in Section [Till there are two conditions to prove (Billingsley ll28l Theorem 13.1]): 

• Finite-dimensional convergence of the process (Xn,X' n ). 

• Tightness on the contour Cfc. 

Remark 4: As un (resp. u' N ) converges almost surely to u (resp. yf) (see Silverstein and Bai ifTTlO . 
the convergence of the process (-Xjv, X' n ,un,u' n ) is achieved as soon as the convergence of the process 
(Xn,X' n ) is proved. 

In ll20l . Bai and Silverstein establish a central limit theorem for F Rjv with the complex Gaussian 
entries Xy. We recall below their main result. 

Proposition 4: [20] With the notations introduced in Section JIJ for /i, . . . , f p , analytic on an open 
region containing R, 

1) [N J fi(x)d(F RN - F N )(x)) forms a tight sequence on N, 

\ / l<i<p 

DRAFT August 29, 201 1 



23 



2) 



(n [ fi(x)d(F^ - F N )(x)) A H(0, V) 

V J J l<i<p 



where V = (Vij) and 



with 



V ij = ~4~2 $ $ fi{zi)fj(z 2 )v ij (z 1 ,z 2 )dz 1 dz 2 , 



rn'(z 1 )m'(z 2 ) 1 

Vij{zi,z 2 ) - 



(m(zi) - rn{z 2 )) 2 (z 1 - z 2 ) 2 
where the integration is over positively oriented contours that circle around the support S. 

Now we apply this proposition to show the finite-dimensional convergence. For all Z{ G Cfc\M, notice 
that 



m A (z) - m N (z) = — <p - -d{F^ - F N ){x) 



2m J x — z 

with the contour who contains the support S and X^{z) = M(m^ (z) —m N {z)). Then Proposition [4] 
implies directly that for all p G N, the random vector 



\X N {z 1 ),X' N (z 1 ), ■ ■ ■ ,X N (z p ),X N (z p )^ 



converges to a centered Gaussian vector by considering the functions: 

fi(x) = — - — , f 2 (x) = — r 2 , • • • , /ap-i(x) = — - — , h P {x) = 3—7 

x — z\ [x — z\y x — Zp (x — z p y 

Thus the finite dimensional convergence is achieved. 

The proof of the tightness is based on Nash-Poincare inequality ([25] and [26]). In Appendix lAl it is 

proved that for all e > and all I G N, 

P ( sup d(A,S) > e J = o(N- £ ). 

Following the same idea as Bai and Silverstein [20, Section 3 and 4], it is indeed a tight sequence. The 
details of the proof are in Appendix [C] Thus Lemma Q] is achieved. 



C. Proof of the tightness 

We will show the tightness of the sequence M(mA — m/v) an d M(m& — ELn) ^ using Nash- 

TV — iV 

Poincare's inequality [26]- First, denote by M(m^ (z) - m N (z)) = M^(z) + M^(z) with M^(z) = 
M(m ft (z)-E[m fl (z)}) and M^{z) = M(E[m^ (z)] - m N (z)). 
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As can converge to infinite if z is close to the real axis, there will be a little trouble for the 

pk-z b ' 

tightness. Then we need a truncated version of the process. More precisely, let ejy be a real sequence 
decreasing to zero satisfying for some 5 g]0, 1[: 

e n > N~ s . 



Remark 5: Notice that Xjy(z) = Mlm-n — m N ) = Xjy(z) for z G C+. So it suffices to verify the 

—N 

arguments for z £ C + . 

Denote by ([x2fc-i, x 2 k], k = 1, • • • , L) the k-th cluster of the support of the limiting spectral measure; 
and take hk-iMk such that x 2 k-2 < hk-i < %2k-l and x 2 k < hk < %2k+i for k £ {1,..,L} with 
conventions xq = and x 2 l+i = oo, i.e. , [hk-li hk] OIU y contains the k-th cluster. Let d > 0. Consider: 



and 



Also 



C u = {x + id : x G [fcft-i, fefc]}- 
CV = {Z 2 fc-l + it> : w G [^ _1 £JV, d}}. 



Q = {hk + iv ■ v £ [N 1 e N ,d]}. 
Then Cat = C\ U C u U C r . The process Mjy(-) is defined by 

Mlf{z) for 2 G Cat, 

M Ar (/ 2fe + iJV- 1 ^) for x = l 2k , v £ [0, N^en], 
M l N {l 2k ^ + iJV^ejv) for x = l 2k ^,v £ [0, N^en]. 
This partition of Cn is identical to that used in [20l Section 1]. With probability one (see lfT8l and 112210 . 
for all e > 0, 

lim sup d(A,§Ar) < e 

Aeeig(R N ) 

with d(x, S) the Euclidean distance of x to the set S. So with probability one, for all N large, (120, page 
563]) 



M 1 N (z)-M 1 N (z))dz 



and 



Ml/(z)-M%(zj)dz 



< K X £ N , 



< K 2 E N 



for some constants K\ and K 2 . Both terms converge to zero as M — > oo. Then it suffices to ensure the 
tightness for M^(z) and M^(z). 
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We now prove tightness based on 11281 Theorem 13.1], i.e. 

1) Tightness at any point of the contour (here Cn)- 

2) Satisfaction of the condition 

E|(Mjy(zi) - M^(z 2 ))| 2 

SUp : 12 S -ft- 

N,z u z 2 eC N \zi - z 2\ 

Condition 1) is achieved by an immediate application of Proposition [4] We now verify the second 
condition. 

We evaluate mHM-Mh^W N & ^ 
21-22I 2 

_ ^ N 1 

m R (^l) — m R (^2)= 7T / — 



_ zi - z 2 
M 

with Dn(z) = — zljv- We have 

dYij \ z\ - z 2 J 

' } Tr(R N - z 1 I)- 1 (K N -z 2 I)- 1 



Tr(D^(^l)D^(^)) 



and 



± [-Y^D^( 2l )D^(z 2 )- YfcD^iP* teO]^. 



5 /™R „(*l) "™R 



y zi - z 2 J 
= jj[-Bx\z 1 )B],\z 2 )Y N -B N \z 1 )n N \z 2 )Y N ] i j. 

Then by the Nash-Poincare inequality and the fact that Rjv is uniformly bounded in spectral norm almost 
surely, one gets 

E|Mi(zi)-Mi(«a)| 2 / Ci. 



< — E 

zi -z 2 2 " AT 



TrfL 



IV, 



= § E ( Tr ( L ^) I sup„^, S )<J + § E ( Tr ( L ^) I sup^(A„, S )>J 

with 

L N = n N -D N 4 ( Zl )B N 2 (z 2 ) + 2R iV D- 3 (z 1 )D w 3 (z 2 ) + RjvD^ 2 (zi)D" 4 (z 2 ) 
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and C\ a constant which does not depend on N or M. For the first term, Tt(Lat) is bounded on the set 



sup n d(X n , S) < e. For the second term, since for all % G N and all z G CV, 



<jr,\t leads that 



\X n -z\ 



N 



n=l l A n - Z| 



< 



Then 



Tr(L N )\ < 



TV 7 



As P(supd(A n ,S) > e) = o(N- lb ), take e N = N~ om , one obtains 



E(Tr(L Ar )I 



sup n d(A„,S)>e' 



< E 

< 

< (N 



N 7 



\supd(X n ,§) > e] 



7-0.06-16^ 



0. 



The second condition of tightness is achieved. 

For Mfj{z), following exactly the same method in (6j Section 9.11], one can show that M^(z) is 
bounded and forms an equicontinuous family that converges to 0. Hence the tightness for M(m^ (z) — 
m N (z)). 

The next step is to prove the tightness of M(m'~ (z) — rn! N {z)). We have 
mi (zx)-rnL (z 2 ) 

N 



M 



E 



2Aj — z\ — z 2 

{ (A; - Z X f(\i - Z 2 f 



z\ - z 2 
M 



Tr (D^ 2 (z 1 )D^ 2 (z 2 )(D iV (z 1 ) + D ff (z 2 ))) . 



Following the same method as derived before, one obtains 

d 



dYij 



~ [Y^B N \z 1 )B N 2 (z 2 ) + 2Y* N B N 1 (z 1 )B N 3 (z 2 ) 



and 



with 



(> ™ N 2 ( Zl )-D N 2 (z 2 )(B( Zl ) +-D N (z 2 )) 



dYij 



L 2 =4R^3D^ v 4 (z 1 )D^ 4 (z 2 ) + 2D^( 2l )D^(z 2 ) + 2D^(*i)D^(> 2 
+ D- 2 (^)D- 6 (.: 2 ) + D" 6 ^)^ 2 ^)) . 
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Then Nash-Poincare inequality yields that 

\M 1 N (z 1 )-M l N (z 2 )\ 



Var^ 

\zi - z 2 \ 

C C 
< — E(Tr(L 2 )I M i , w ) + — E(Tr(L 2 )I M \ « w ) 

with Ci the same constant defined as before. The term Tr(L 2 ) is bounded on the set supd(A n ,S) < e. 
For the second term, |Tr(L 2 )| < As P(supd(A n ,S) > e) = o(N~ l& ) and e^v = iV~ - 01 , the 

proof of the tightness of Mjy- (z) is achieved as before. 

The proof of the tightness is completed with the verification of M^'(z) for z G C n to be bounded 
and forms an equicontinuous family, and convergence to 0. We will use the same method for the process 
M%{z) (see Section 9.11]). 

By Formula (9.11.1) in Section 9.11], they show that 

JV r m N t 2 dF R N (t) \ 
M J 7T+tIm~ )(l+tra w ) \ 

( Ero R„ " Sat) I 1 " — -TTli^T- = (22) 



where 

TV - 
i=i 

dj. = ^-(z) = -q^R^^R^ _ 2l) -1 (Em^ R + I) -1 ^/ 2 ^- + (l/M)Tr{Em A Yl + I)- 1 R(R JV - zl) 

R a) = Riv- ^y;y]- 

If one derives ((221) with respect to z, the equation becomes 

/ TV r m N t 2 dF R K{t) \ / JV r m N t 2 dF R ™{t) 

( Em L -Sat) 1 " - , jy r ^ ^ + ( Em L ~ZSat) 1 



JV r tdF^N _ m I ' \ ii v — « ' I - _ , JV f tdF R N _ T 
r M J 1+fflm*.. " / \ z_l "AfJ l+fflm 4 N 



= Em'^ ELnTn + Em^m' N Tpf + Em-^^rn N T' N . 
In the work of |6, Section 9.11], they show that when TV tends to infinity, 

!) sup zeCjv \Em^Jz) -m(z)\ -> and sup zeCiv Im/vO*) - ZZ»(«)| -> 0, 

N_ f t 2 rn, N dF R N (t) 
M J (l+«Em ft )P+t!» W ) 

2) 7 V converges , 

~ Z+ M J l + t E mft Jj V 

3) M^) -> O^Tjv -> 0. 
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With the same method, one can show easily that 

4) sup, eejv \Em'- (z) - m'(z)\ -> 0, 

5) sup^ eew \m' N (z) -m'(z)\ -»• 0, 

6) ^(E^iW/)' converges. 

With these results, it suffices to show that T' N — > 0, and M^' is equicontinuous. 

In |]6j Section 9.9], they show that for m,p 6 N and a non-random N x N matrix k = 1, .., m 
and £ = 1, .., g, we have 



m 9 

<KM-^l[\\A k \\l[\\B e \\. (23) 

fe=l £=1 

We have also that for any positive p, 




m ax(E||D- 1 (z)|r,E||DT 1 (z)r,E||D 4 r. 1 (z)f) < K p (24) 

and 

sup II (Em* (z)R + I)" 1 || < oo (25) 

n, 2 ee„ — " 

where is a constant which depends only on p. 

With all these preliminaries, as Tjy — » 0, by the dominated convergence theorem of derivation, it 
suffices to show that T' N is bounded over Cat. In (6j Section 9.11], it is sufficient to show that (f' M (z)) 
is bounded where 



M 

Im(z) = ^E[(rp- 1 r J -M- 1 TrDT 1 R)( r pT 1 (Em^R+I)- 1 r J -Af- 1 TrDT 1 (Em^R+I)- 1 R)]. 

3=1 

With the help of (|23])-(|25]), f' M (z) is indeed bounded in Cat. 

Now we will show that Mjy is equicontinuous. With the light work as before, it is sufficient to show 
that /'^(z) is bounded. Using d23l , we obtain 
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1/2 



1/2 



1/2 



1/2-, 



Thanks to (l24l ) and d25l ), the right side is indeed bounded. This ends the proof of the tightness. 
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